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DEDIC21TSD ALU ARCHITECTUllE FOR 10-BIT REED-60L0M0N 
ERROR CORRECTION MODULE 

Field of the Invention 
This invention pertains to the field of error correction in 
digital data processing and storage systems and, more particularly, 
to methods and apparatus for employing a 10-bit Reed-Solomon code 
for the detection and correction of digital data errors within a 
block of data being transmitted between elements of a data 
processing and/or storage system. 

Background of the Invention 
Error correction techniques and architectures are well known 
in digital data processing and communications systems, including 
systems having data storage subsystems such as magnetic, optical, 
or semiconductor based memory stores. Detection and, where 
possible, correction of erroneous data has been achieved by using 
an encoder circuit to construct some number of "redundant" m-bit 
error check symbols, which mathematically characterize the 
• information in a selected block of data. The error check symbols 
are then appended to the data block and transferred or stored 
therewith. When the data block is received, or later retrieved 
from memory, the accuracy of the data can be evaluated by use of 
these appended error check symbols. For example, one methodology 
is to repeat the encoding process on the received or retrieved 
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block of data, often using the same encoder, and then compare the 
newly derived error check symbols with those previously appended to 
the "data block. if the newly derived error check symbols are 
identical to those appended to the data block, the information 
contained in the received data block is presumed to be error free; 
if not, "error syndrome" information is generated from the 
difference between the newly derived and originally appended error 
check symbols, which, depending upon the nature and amount of the 
error check information, can be used to locate the error (s) within 
the data block and determine the correct data values for 
substitution therefor. 

The use of Reed-Solomon codes has become a prevalent 
methodology for performing error detection and correction in 
digital communications, processing and storage applications. In a 
Reed-Solomon code, sequential m-bit data symbols forming a data 
block are treated as being representative of coefficients of a 
polynomial in a variable, e.g., "x". in particular, a sequence of 
k m-bit data "message" symbols {m^ m,, m,, . . ^ , v^t } are treated 
as a "message polynomial," m(x) , of degree k-1, where 

m(x) = moX^» + m,x*'2 + . . . + j^^^x + m^.,, 
An encoder divides the message polynomial, m(x) , by a selected 
"generator polynomial," g{x), to produce a "remainder polynomial," 
r(x), having coefficients in the form of r m-bit error check 
symbols . 



- 2 - 
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A Reed-Soloxnon generator polynoraial, g(x), has 1:he general 

form: 

r-l 

where a ("alpha") is a primitive element of a Galois field, GF{q), 
whose q-l powers, {a^a^a^. . .^a^^}, exhaust the non-zero elements 
of the field and where is some arbitrary logarithm base a of the 
first root of the generator polynomial. Usually, in binary 
systems, q is a power of 2, i.e. , q = 2«, for some m. For example, 
where r = 4, — i.e., where four redundant error check symbols are 
to be generated ~, the generator polynomial may be expressed in 
the factored form: 

g(x) = (X - a")(x - ft»+M(x - o^'^^'Xx - a"**). 
As can be seen, the nature of the generator polynomial, g(x) , 
determines, among other things, the extent and complexity of the 
error correction code. In particular, the degree of the generator 
polynomial determines the number (r) of error check symbols 
contained in the remainder polynomial, r(x). However, the greater 
the degree of the polynomial that is selected for the generator 
polynomial, the more complex the associated encoder circuitry must 
be to perform the necessary computations. 

Generally, the error check symbols are appended to the message 
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symbols to form a Reed-Solonon "code word" of length n - k + r ra- 
bit symbols, where the message and error check symbols are elements 
in the finite Galois field GF(2-). This is referred to as an "n- 
symbol codeword," consisting of "m-bit symbols," or as an example 
of an "m-bit Reed-Solomon implementation." By definition, each 
code word, when considered as a polynomial. c(x) , of degree n-1. 
Where 

C(X) = CoX^» + C,X*» + . . . + c,,X + c^„ 

is evenly divisible by the generator polynomial, g(x). in other 
words, c(x) mod g(x) = o. since a correctly received code word, 
c'(x), will also be evenly divisible by g(x) , one well known 
procedure for detecting errors upon receipt or retrieval of a code 
word, c'(x), is to divide it by g(x) . If c'(x) mod g(x) is zero, 
then c'(x) is presumed to have been correctly received. if the 
remainder of • c (x) /g(x) is non-zero, then one or more errors have 
occurred and an error correction routine is invoked by calculating 
r m-bit "error syndromes," S. . . . s„„ respectively, (i.e., 
"symptoms of error"), from the non-zero remainder of c'(x)/g(x). 

several procedures are known for computing the error locations 
and proper values from the calculated error syndromes. These 
techniques involve error-locator-polynomial determination, root 
finding for determining the positions of the errors, and error 
value determination for determining the correct bit-pattern of the 
errors. In many error detection and correction architectures, the 
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block of data being evaluated is temporarily stored in a buffer 
memory, while the error correction procedure is performed with the 
error syndromes generated from the received, or retrieved/ code 
word. Known architectures may perform the error correction 
routines by using hardware circuitry under the control of a 
programmed state machine or arithmetic logic unit, by using a 
microprocessor under firmware control, or by some combination of 
both. For example, several known architectures perform the 
correction of a very small number of errors, typically no more than 
one or two, by hardware circuitry "on-the-f ly,« i.e., without 
stopping or substantially slowing the bit-rate of the data blocks 
during a typical transfer, or retrieval, of multiple blocks of 
data. The correction of more than these one or two errors, if 
enabled by sufficient error syndrome information, is typically 
15 given to a system level microprocessor to carry out the error 
correction operations on an as-needed, or "interrupt" basis, i.e., 
where the data flow is temporarily slowed, or stopped. When this 
happens, the microprocessor is fed the error correction syndrome 
information for a data block having the more than one or two errors 
and calculates values for locating and correcting all of the errors 
under direction of error correction firmware. In the event the 
data is not correctable, i.e., where there are more errors detected 
in the block than can be corrected by the hardware and/or firmware 
error correction architecture, an error recovery procedure may be 



20 
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attempted, e.g., which nay involve one or more retries to recover 
data from the storage media. 

one general limitation of Reed-Solonon codes is that the 
maximum number of m-tait data symbols that can be corrected within 
an m-bit code word is limited to the integer portion, or "floor," 
of the number of appended error check symbols (r) divided by two, 
i.e. , lNT(r/2) . Another general limitation is that each m-bit code 
word is limited to a maximum length of 2- - i symbols, sometimes 
referred to as its "natural block length," including both the 
source data symbols and the appended error check symbols. Thus, 
implementations of Reed-Solomon error correction techniques must 
take into account these limitations. 

For example, commonly owned U.S. Patent No. 5,241,546, issued 
to Peterson et al., discloses a system architecture which employs 
a Reed-Solomon code to detect and correct data errors in a disk 
drive data storage system which handles data in blocks, or 
"sectors," containing 512 8-bit symbols (or "bytes"), dtl], d[2], 
• . d[512], respectively. Because the total number of 8-bit 
symbols that can be protected in a single 8-bit code word is 2* - 
1 = 255 symbols, each 512 byte data sector is divided into smaller 
blocks for purposes of performing error correction, in so doing, 
the Peterson et al. Patent also takes into account that data errors 
encountered in certain digital data communication and storage 
systems, such as in magnetic based disk drive systems, are often 
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"bursty" in nature, i,e. , the data errors tend to appear across 
consecutive bit positions within a block of data, as opposed to 
occurring randomly. The 512 data bytes are therefore serially 
divided into three separate "interleaves," each interleave 
5 containing every third data byte, ~ i.e., with interleave 1 
containing data bytes d[l], d[4], dt7], . . ., d[511J, interleave 
2 containing data bytes d(2], d(5l, d[8], . . d[5i2], and 
interleave 3 containing data bytes d(3], d[6J, d(9J, . . . , 
d[510], respectively. Each interleave is then separately encoded 

10 to form a corresponding 8-bit Reed-Solomon code word. 

By distributing successive data bytes into separate code words 
for purposes of error correction, the 3-inter leave architecture 
offers the advantage of, for example, treating the occurrence of an 
"error burst" of three successive corrupted data bytes as, in 

15 effect, a single byte error occurring in each code word. In other 
words, because of the substantial likelihood that data errors, if 
any, will occur in consecutive data symbols, the 3 -interleave 
architecture increases the probability that encountered symbol 
errors will be distributed evenly into separate code words, thereby 

20 facilitating correction of a greater number of total symbol errors 
per data sector, without requiring additional error correction 
capability for a given code word. 

The advantage of an interleave architecture is lost, however, 
if the communication, data processing, and/or storage system 
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application is not prone to bursty error patterns. For example, in 
a digital data system employing a semiconductor based memory in 
which error patterns occur in a substantially random manner, 
implementation of an interleave architecture may actually result 
in a substantial portion of the allocated error check redundancy 
being ineffective, since encountered errors are much less likely to 
be evenly distributed across the interleaves. Moreover, if the 
number of errors occurring in a single interleave exceed the error 
correction capability of that code word, the entire data sector may 
be lost, even if the total number of errors occurring sector-wide 
would otherwise have been within the allotted error correction 
capability had they been distributed evenly across all interleaves. 

Thus, in a non-bursty applications it would be desirable to 
fully utilize all allocated error check redundancy to perform error 
correction for an entire data block or sector, e.g. , by using of a 
single Reed-Solomon code word, regardless of the particular symbol 
size format of the data block. 
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Summary of the Invention 
The present invention provides a Reed-Solomon error correction 
methodology and architecture which is suited for digital data 
communication, processing and/or storage systems encountering 
substantially random error patterns, such is characteristic of a 
data storage and retrieval system employing a semiconductor based 
memory store. 

A general object of one aspect of the present invention is to 
provide a Reed-solomon error correction methodology and 
architecture ideally adapted to a mass production environment, 
which minimizes hardware components and the related production 
costs associated therewith. 

A general object of another aspect of the present invention is 
to provide system architecture for implementing a 10-bit Reed- 
Solomon code for detecting and correcting data errors in a single 
code word to protect a data block containing up to 1023 lO-bit data 
symbols, i,e., the equivalent of up to 1278 8-bit symbols, 
including error check redundancy, 

A general object of yet another aspect of the present 
invention is to provide a Reed-Solomon error correction 
architecture which maximizes the use of all allocated error 
correction overhead for an entire block of data, regardless of the 
particular error pattern characteristics encountered in a given 
system application. 
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A general object of yet another aspect of the present 
invention is to provide a practical implementation of a 10-bit 
Reed-Solomon error correction code by employing 5-bit extension 
field operations over a Galois field (2^) to perform certain, 
requisite arithmetic functions with a hardware-minimized error 
correction architecture, 

A more specific object of the hardware-minimized 
implementation aspect of the present invention is to provide a 10- 
bit Reed-Solomon error correction methodology and architecture 
which employs 5-bit extension field operations utilizing a Galois 
field GF(2*) generated from the generator polynomial g(x) « x^ + x' 
+ 1, over GF(2) , 

Yet another more specific object of the hardware-minimized 
implementation aspect of the present invention is to provide a 10- 
bit Reed-Solomon error correction methodology and architectxire 
which generates the Galois field GF(2'®) using the irreducible 
polynomial P,cD4(y) « + y + 1, over a Galois field GF(2^) generated 
from the generator polynomial g(x) = x* + x^ + 1, over GF(2) • 

In accordance with these and other objects, features and 
aspects of the present invention, a data block of m-bit data 
symbols is formatted into a sequence of k 10-bit data symbols and 
passed through an encoder which constructs a preselected number r 
10-bit error check symbols, where k + r < 1023. The error check 
symbols are converted back into an m-bit symbol format and appended 

- 10 - 
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to the original m-bit data block for transport or storage. Upon 
being received, or later retrieved from storage, the m-bit data 
block, including the appended error check symbols, is again 
formatted as a sequence of 10-bit symbols and passed through a 
5 decoder/ syndrome generator, which generates r lo-bit error 
syndromes. If any of the error syndromes are non-zero, thereby 
indicating the existence of one or more lO-bit symbol errors, the 
error syndromes are fed into a error correction module, which, 
using a Galois field arithmetic logic unit ("GF-ALU") under the 

10 control of a programmed state machine , conducts 10-bit Reed-Solomon 
error correction operations (i.e., over a 10-bit Galois field 
GF{2'**)), preferably to determination the location and correct 
values of up to two 10-bit symbol errors by hardware solution and 
up to four 10-bit symbol errors under firmware control. 

15 According to one aspect of the present invention, the GF-ALU 

performs certain arithmetic functions, preferably including at 
least the 10-bit multiplication and inversion functions, 
respectively, by 5-bit extension field operations over a Galois 
••sub field" GF(2') , generated by using the irreducible polynomial: 

20 Pn(x) = x^ + x' + 1, over Gr(2), 

with each 5-bit nibble represented in a standard basis as a fourth 
degree polynomial of "x'^; i.e., where a 5-bit nibble {ca4,d3,d2,d,,do} 
is represented as coefficients in the polynomial: 
d^x^ + djx' + djX^ + d,x +do. 

- 11 - 
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Addition Of the 5-bit nibbles is preferably performed through 
a bit-wise XOR operation. Multiplication is preferably performed 
as Standard polynomial multiplication, reduced modulo P„(x) . 

According to another aspect of the present invention, the 
field GF(2'«) is generated using the irreducible polynomial: 

Pioi4(y) = + y + 1, over GF(2'), 
with the 10 bit symbols represented in a standard basis as a sum of 
the powers of "y" over GF(2'), i.e., with each 10-bit symbol treafd 
as two 5-bit nibbles (e„ e.) , which represents e,y + eo, to perform 
the requisite arithmetic functions. Addition of the 10-bit symbols 
is preferably performed through a bit-wise XOR operation. 
Multiplication is preferably performed as standard polynomial 
multiplication, reduced by modulo P,^(y). m one preferred 
embodiment, the element a = (loi)h (i.e., {0100000001} binary), 
15 serves as a primitive element of GF(2'*). 

In this manner, lo-bit arithmetic operations are accomplished 
by concatenating the result of 5-bit extension field operations 
performed on the most significant and least significant 5-bit 
nibbles, respectively, of the respective lo-bit symbols. 

These and other objects, aspects, advantages and features of 
the present invention will be more fully understood and appreciated 
by those skilled in the art upon consideration of the following 
detailed description of a preferred embodiment, presented in 
conjunction with the accompanying drawings. 



20 
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Brief Des cription of the Drawing s 

It is to be understood that the accompanying drawings are 
provided for the purpose of illustration only, and are not intended 
as a definition of the limits of the invention. The drawings 
illustrate both the design and utility of a preferred embodiment of 
the present invention, in which: 

FIG. 1 is a functional block diagram of the host-to-memory 
(write) circuit of a semiconductor based data storage and retrieval 
system embodying aspects of the present invention; 

FIG. 2 is a functional block diagram of the memory-to-host 
(read) circuit of the semiconductor based data storage and 
retrieval system shown in FIG. l; 

FIG. 3 is a flow chart depicting a preferred error correction 
process ; 

FIG. 4 is a block diagram illustrating the functional system 
architecture of a preferred error correction module; and 

FIG. 5 is a block diagram illustrating the functional system 
architecture of a preferred Galois field arithmetic logic unit 
within the error correction module of FIG. 4. 



Detailed Description of the Preferred Embodiment 
Referring to FIG. 1, a host data processing system 10 
packetizes a block of digital data for storage in an adjunct data 
storage and retrieval system 12. While particular blocks of data 
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being transported by the host system 10 to the storage system 12 
may vary in size and format with differing applications and 
operating system architectures, for purposes of describing the 
illustrated preferred embodiment, a commonly used data block size 
is selected, which contains 256 16-bit data symbols for a total of 
4096 bits. A host interface circuit 14, such as. e.g., a SCSI or 
IDE interface device, receives the data block from the host system 
10. The interface circuit 14 then transmits the data block through 
a 16: 10 bit wide conversion circuit 16, which serially reformats 
the 256 16-bit data symbols into 410 10-bit data symbols, i.e., a 
total of 4100 bits, with the final four bits of the 4l0th data 
symbol comprising added zeros. The data block is also transmitted 
by the host interface circuit 14 directly into to a holding buffer 
18, Without reformatting. 

In alternate embodiments, the data block 12 may be packetized 
in varying bit formats by the host system 10 and/or host interface 
circuit 14. For example, the same size data block (i.e., 4096 
bits) may be packetized in a "byte-size" format as 512 8-bit data 
symbols, or "bytes," as well, in this later case, the bit-wide 
conversion circuit 16 would be designed to reformat the 512 8-bit 
data symbols into the 410 10-bit data symbols, i.e., at an 8:io 
ratio instead of a 16: 10 ratio. Alternatively, the data block may 
be originally packetized by the host system 10 in a 10-bit symbol 
format, which would eliminate the need for the bit-wide conversion 
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circuit 16 within the data storage and handling system 12, 

Whatever the original symbol bit-format of the data block, 
once converted into 10-bit format, the individual data symbols are 
transmitted through a 10-bit Reed-Solomon encoder circuit 20, which 
generates a preferred number r of i 0-bit error check symbols based 
on a preferred generator polynomial, g(x), where r is equal to the 
polynomial degree of g(x). For purposes of describing the 
illustrated preferred embodiment, wherein correction of up to two 
10-bit data errors by direct hardware solution and up to four 10- 
bit errors by firmware solution is preferred, an exemplary 
generator polynomial g(x} of degree r « 8 has been selected, where 
(in factored form) : 

g(x) = (X + a®) (X + fl£*) (X + a') (x + a') (x + o^)* 

*(x + a^) (X + a*) (x -f a'') • 
While any number of known Reed-Solomon encoder circuits may be 
utilized to generate the error check symbols, e.g., such as an 
eight stage linear shift register with feedback, a combined 
encoder /syndrome generator circuit 20 of the type disclosed in an 
article entitled **A Combined Reed-Solomon Encoder and Syndrome 
Generator with Small Hardware Complexity," by G. Fettweis and M, 
Hassner, p 1871-74, IEEE, 0-7803-0593-0/92 (1992), is preferably 
employed. An embedded system microprocessor 22 directs the 
combined encoder /syndrome generator circuit 20 to switch to "encode 
mode," wherein the encoder circuit 20 divides the 410 10-bit data 
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symbols of the data block by g(x) to produce a "remainder" 
comprising 8 10-bit error check symbols. 

The 8 error check symbols are transmitted through a 10:16 bit 
wide conversion circuit 24, which serially reformats the 8 lo-bit 
error check symbols into 5 16-bit symbols. The 5 16-bit symbols 
are then appended to the 256 16-bit data symbols of the original 
data block in buffer 18, i.e., which were transmitted directly from 
the host interface circuit 14, the 261 16-bit symbols forming a 
code word- The code word is released from buffer 18 by the system 
microprocessor 22 and is stored in a semiconductor based memory 
store 26, such as, e.g., a ••FLASH" RAM. m alternate 
configurations and embodiments, the data block may be transmitted 
directly from the host interface circuit 14 into the memory store 
26, i,e,, without employing a holding buffer, with the reformatted 
(16-bit) error check symbols appended thereto directly in the 
memory store 26. 

Referring to FIG. 2, the 261 16-bit code word is retrieved 
from the semiconductor memory store 26 and transmitted into a 
holding buffer 28, which may be the same buffer as buffer 18. The 
code word is also transmitted through a 16:10 bit-wide conversion 
circuit 30. Preferably, holding buffer 28 retains .only the first 
256 symbols of the code word, i.e., only the symbols comprising the 
original source data and not the 5 error check symbols, while all 
261 symbols are passed through the 16:10 bit-wide conversion 
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circuit 30, The bit-wide conversion circuit 30 reformats the 261 
* 16-bit code word into a 418 * 10-bit codeword, ~ i.e., the 410 
10-bit source data symbols, including the added four zeros in the 
410th symbol, plus the 8 10-bit error check symbols. 

The 418 10-bit symbol code word is passed through the combined 
encoder/syndrome generator 20, which, having been switched to 
"syndrome" mode by the system microprocessor 22, generates eight 
10-bit error syndromes. So . . - S7, respectively, therefrom. Other 
syndrome generation circuitry may be alternatively be employed, 
such as conventional DFT circuits, or the like. The error 
syndromes are evaluated in the encoder/ syndrome generator 20 and, 
if all syndromes are determined to be zero, (i.e., 80 consecutive 
zero bits) , the retrieved data block is presumed to be error free 
and the system microprocessor 22 signals the holding buffer 28 to 
release the 256 i6-bit data symbols to the host interface circuit 
14. The host interface circuit 14 then transmits the data block to 
the host system 10 for use in its intended application. 

If, however, one or more of the 8 10-bit error syndromes is 
non-zero, the syndromes are supplied to an error correction module 
36 for determination of the location and correct values of the 
symbol (s) in error. As depicted in the flow chart in FIG. 3, the 
error correction module 36 executes the "direct solution," or 
"Peterson Gorenstein Zierler" algorithm, for finding the 
location (s) and value (s) of up to two 10-bit symbol errors using a 
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hardware solution. If the existence of more than two errors is 
detected during the error correction routine, the error syndromes 
are transferred to the system microprocessor 22, which, under 
firmware control, can determine the correct locations and values of 
up to four 10-bit symbol errors. 

Referring to FIG. 4, the error correction module 36 operates 
under the control of a programmed state controller 42, which 
directs a specialized 10-bit Galois field arithmetic logic unit 44 
("GF-ALU"), to perform the requisite addition, multiplication, 
inversion, squaring and table look-up operations necessary to 
execute the error correction algorithm depicted in FIG. 3, Eight 
registers, - R,, 46a-h, respectively, are provided for holding 
interim results, with register load enables 48 (not shown in 
detail) provided to control which register is loaded with the 
15 result of the operation from the particular preceding instruction 
of the state controller 42. In the illustrated preferred 
embodiment, register Ro, 46a, is a dedicated source operand for 
addition operations and register R,, 46b, is a dedicated source 
operand for multiplication operations, respectively. Registers R^, 
46e, and Rj, 46f , respectively, are used for storing the calculated 
error location (or "locator") values, and (if double error) X,, 
respectively. Registers R^, 46g, and R,, 46h, respectively, are 
used for storing the calculated error values, Yo and (if double 
error) Y,, respectively. It may be possible to use a different 
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register configuration in alternate embodiments, depending on the 
order of steps performed. 

• The following tables list the preferred state instructions 
given by the state controller 42 to the GF-ALU 44. Table 1 lists 
the state instructions for the "Double Error Decoding" process, 
Table 2 includes the state instructions for "Double Error Checking" 
process, and Table 3 includes the state Instructions for "Single 
Error Decoding and Checking" process, respectively. In the state 

tables, "St " identifies each given state; "src" identifies the 

"source" register or error syndrome, respectively, which contains 
or comprises the input data to be used in the specified operation. 
"Inst" refers to the particular instruction, or mathematical 
operation to be performed by the GF-ALU 44, where "load" calls for 
loading the input value into a specified register, "mult" calls for 
multiplying the input value with the contents of register R^, "sqre" 
calls for squaring the input value, "accm" calls for adding the 
input value to the contents of register Rq, and "noop" calls for a 
branch to another state if a specified zero or non-2ero value is 
determined. The "dec" entry refers to the specific register 
location, if any, in which the resulting value of a given state 
operation is to be stored. The new contents, if any, of the 
registers following each state operation are also included, as is 
a brief comment to explain, if necessary, the operation that was 
performed. 
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branch if nonzero (o decoder fails 
branch if nonzero lo decoder fails 

branch if nonzero (o decoder fails 

set status for successful single error 
decoding 

noop unconditional branch to state 67 

set status to indicate decoder failure 

load X« value into locator output 
register 

load value into error value output 
register 

decoder complete - idle state 
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15 



20 



In the preferred state instructions set forth in Tables 1-3 
respectively, there are a total of 106 possible instructions (or 
-states..) to be carried out during execution of the error 
correction algorithm.. The state controller 42 will continue serial 
execution of instructions, until either a double or single error 
solution is completed, or until both fail, with the actual number 
Of states depending upon which event occurs, if either the double 
or single error decoding process is successfully completed, both 
the location(s) of the lo-bit sy»bol error(s), x, and (if double 
error) X„ respectively, and the correct 10-bit data value(s) for 
substitution therefor, and (if double error, y., respectively, 
are obtained fron. the respective registers - r,, 46e-h, by the 
system microprocessor 22. m the event both double and single 
solutions fail, the existence of more than two errors is presumed 
and the eight error syndromes, - s,, respectively, are 

transferred to the system microprocessor 22, which preferably can 
calculate the locations and correct values of up to four lo-bit 
symbol errors under firmware control. 

Whether calculated by the GF-ALO 44, or by the system 
microprocessor 22, the corrected data location(s, and value(s,, 
respectively, are substituted for the erroneous data value(sj by 
the system microprocessor 22. while the retrieved data block is 
still retained in the holding buffer 28. Because the location (s) 
and correct value(s,, respectively, are calculated based on the 10- 
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Wt .y«»l fo™,t, th. -icroprocessor » „,„.l,t.s th. 1.- 

b« locatic„,s, .„a values, into th. co„«p„„«„, xe-Mt 
locitionis, and v.l„.,.,. sul»tit„ti.» i„ the .at, bloc- 

The data hloc. ia then ralaasad fron the hoI«„, h«,„ to the 
hct interface circuit 1. and host .,.t„ „sp.ctival,, ,or „.e 
m Its intended application. 

Referring to KXC. s. the state controller 42 serially invo.es 
each new CP-ALU instruction « to the CP-^„ , ^^^^ 

contained either in one of registers, • . . K„ .ea-h 
respectively, or comprising a particular error syndrome, s, 
S,, respectively, is used as the input value for the operation 
indicated in GF-ALU instruction 45. possible xo-Mt 

arithmetic operations, i.e. over a Galois field GPf^'o, or 
"GP(1024,,« include addition 5., which is preferably carried olt by 
X'OR operation, multiplication 54, inversion 56, and squaring 58. 
respectively, as indicated in the particular state instruction. A 
* y c solution table look-up function 60 is also provided in 
the GP-ALU 44, for finding the quadratic solutio„(s, i„ the Galois 
field GP(2.0) .or a given value of c in the double error decoding 
process . 

After each instruction, the gp-alu output 49 is loaded into 
one Of registers, . . . R„ 46a-h, respectively, which is readied 
by a destination address 47 sent by the register load enables 48 
The GP-ALU output 49 is occasionally checked for a zero or non-zero 
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value by a zero detect circuit 62, in order to verify whether a 
particular single or double error solution attempt has succeeded or 
failed, respectively. if either a zero or non-zero value is 
detected, depending upon the particular state, a status bit or 
"flag.. 66 is sent to the state controller 42, which will set the 
appropriate status, e.g. .-decoder fails,.' or "successful 
double/single error decoding,.- to alert the system microprocessor 
22 to obtain the requisite information from either the syndrome 
generator (if "decoder fails") or the appropriate registers R, - R,, 
46e.h, respectively, m certain state operations, the failure tl 
locate a viable solution in the look-up table 60 will also trigger 
a status flag 66 to be sent. 

In accordance with one aspect of the present invention, the 
operations of the GF-ALU 44 are preferably substantially hardware- 
minimized by employing 5-bit extension field operations over a 
Galois field GF(2»), or GF(32). to assist in performing certain of 
the 10-bit arithmetic operations. m the illustrated preferred 
embodiment, the 5-bit extension field operation units include 5-bit 
multipliers 55, and inverters 57, respectively. 

In the 5-bit operations, each 5-bit nibble or "number" is 
treated in a standard basis as a fourth degree polynomial, i.e.. 
Where each 5-bit number, {d„d„d„d„d„} , is represented as 
coefficients in the fourth degree polynomial, d.x' + d,x^ + d,x' + d,x 
+doX». The Galois field GF(2'), or GF(32) , which is defined by an 
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arbitrarily selected first element and a fifth degree generator 
polynomial, designated herein as «p(x)," is preferably generated 
based on the irreducible polynomial x» + + i. its first element 
a' (i.e., excluding o») , is selected to be equal to x, where x 
represents the binary field element in polynomial representation 
over GF(2), as is known in the art. The 31 possible non-zero field 
elements are represented by the successive powers of alpha, 
including a". Accordingly, each successive element in the field may 
be determined by multiplying the preceding field element by a', mod 
P(x) . Put another way, the non-zero field elements are represented 
by the antilogs of the powers of a from 0 to 31, where each of the 
antilog values are calculated mod p(x) , so that no antilog value 
can exceed 31, with o" mapping back to o», (i.e. , a^' = o« - i) . For 
example: 

= x' mod p(x) = 2' = (binary) 00010; 
tt^ = a' o' mod p(x) = x' x' nod p(x) » x' » oOlOO; 
ftJ - et^-o* mod (p(x) - 01000; 
a* ■ et'-e' mod (p(x) = lOOOO; 

a* = a^ a« « x* x' mod p(x) - x* nod (x^ + x» +1) » 

X* + 1 = 00101 (etc.) . 

After the GF(32) antilog table is calculated, generating a log 
table ("base a") is simply a matter of reversing the antilog table 
by mapping each power of alpha with its corresponding antilog 
value. Hardware multipliers, inverters and adders for GF(32) with 
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a» = X and p(x) = + + i over GF(2), are maintained within the 
GF-ALU 44 to facilitate the 5-bit extension field operations. To 
facilitate further explanation of the 5-bit extension field 
operations, the antilog and log tables (decimal) for GF(2*) are set 
5 forth in Tables 4 and 5: 

TABLE 4. Antilog Table/Elements of GF(2^. 
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a- 
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a* 
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16 


27 
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17 


19 
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TABLES, Log Table for GF(2*) 
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LOGpJ 




0/31 


17 


10 


2 


1 


18 


30 


3 


18 


19 


17 


4 


2 


20 


7 


5 


5 


21 


22 1 


6 


19 


22 


28 


7 


11 


23 


26 


8 


3 


24 


21 




29 


25 


25 


10 


6 


26 


9 


11 


27 


27 


16 


12 


20 


28 


13 


13 


8 


29 


14 


14 


12 


30 


24 


15 


23 


31 


15 


16 


4 







Preferably, the system microprocessor 22 is also provided with 
appropriate Galois field logs and antilogs to facilitate 
nations for the three and four error correction routines. 

Once the results for GF(2») are determined, the lo-bit 
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multiplication and inversion operations may be easily performed by 
5-bit "extension field" operations. In particular, according to 
another aspect of the present invention, the 10 bit numbers are 
represented in a standard basis as a sum of powers of **y'* over 
5 GF(2*), with each lO-bit symbol treated as two 5-bit nibbles or 
"numbers," (eo, e,), which represent e,y + e©, to perform the 
requisite multiplication and inversion functions. Preferably, the 
multiplication of the 10-bit numbers is performed as standard 
polynomial multiplication, reduced by mod PiowCy), where P(y) is the 
10 irreducible polynomial: P,o24(y) - y' + y + i, over GF{2*) . 

By way of example, let m© = 0101110100 and m, = lOOllOllli. A 
10-bit multiply of m^ * m, is required. According to this aspect of 
the invention, it is carried out as follows: 
mo = {010li}y + {10100}; 
15 mj « {I0011}y + {01111}. 

Letting {01011} « a (= (ll)dec); 

{10100} = b (= (20) dec) ; 
{10011} = c (= (19) dec); and 
{01111} = d (= (15)dec); 
20 mo * m, = (ay + b) (cy + d) = acy^ + (be + ad)y + bd. 

(acy^ + (be + ad)y + bd) mod (y' + y + 1) = 

(ac + be + ad)y + (bd + ac) . 
Each of the two 5-bit products, (i.e., ac, be, ad, bd) , may be 
determined by referring to the previously generated antilog and log 
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(base a) tables, as follows: 

ac = antilog„[ (log^a + log.c) mod (31)] 

= antilog[(27 + 17) mod (31)] 
= antilog[i3] = 28. 
5 be = antilog.I (log„b + log.c) nod (31)] 

= antilog[(7 + 17) mod (31)] 
« antilog[24] « 30. 
bd = antilog^((log„b + log.d) mod (31)] 

« antilog[(7 + 23) mod (31)] 
= antilog[30] = 18. 
ad = antilog^t (log^a + log^d) mod (31)] 

= antilog[(27 + 23) mod (31)] 
= antilog[19] « 6. 
Addition of the five bit products is performed by standard XOR 
15 operation; 

(ac + be + ad) = {11100} + {llllO} + {00110} = {00100} ; and 
(bd + ac) « {10010} + {11100} = {OHIO}. 
Concatenating the two results, Dq * m, » {0010001110} • 

To ensure full disclosure of the aforedescribed preferred 
20 embodiment, a corresponding verilog listing of the error correction 
module 36 is provided as follows: 
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module decodert 



15 



constcik, 


/« positive edge byte clock 






decstart, 


start of decoding 






resetn, 


/* decoder negative reset 






syrsOOOO, 


syndrome 0 






syrsOOOl, 


l-k syndrome 1 




«/ 


5yrs0002, 


h syndrome 2 






syrsOOOS. 


syndrome 3 




iff 


syrs0004i 


/* syndrome 4 






syrsOOOS, 


syndrome 5 






syrsOOOe. 


/★ syndrome 6 




♦/ 


syrsOOO?, 


1* syndrome 7 




*/ 


locaterO, 


/* locater of first error 




*/ 


locaterl , 


/* locater of 2nd error 




*/ 


errvaluO, 


/* value of first error 


*/ 




errvalul , 


/* value of 2nd error 


*/ 




decrstat. 


decoder status flags 






finished 


status indicating finished 


♦/ 





); 



20 input constcik; 
input decstart; 
input resetn; 

input [9:0] syrsOOOO; 
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input 19:0] syrsOOOl; 

input 19:01 syrsOO02; 

input * I9:0J syrs0003; 

input (9:01 syrs0004; 

5 Input [9:0] syrsOOOS; 

input (9:01 syrs0006; 

input 19:01 syrsOOO?; 

output (9:01 locaterO; 

output [9:0] locaterl; 

10 output [9:0] errvaluO; 

ouput (9:0} errvakil; 

ouput (9:0) decrstat; 

output [1:01 finished; 



// synopsys state. vector st8 

parameter [6:0] /★ synopsys enum bus.states 



20 



st.OO 




7'hOO. 


s^Ol 




rhOI, st02 


« 7'h02, st03 


= 7'h03. 


st.04 




7'h04. 


st.05 




7*h05. st06 


= rh06, SL07 


= 7'h07, 


st.08 


s 


7'h08. 


st.09 




7'h09. st.Oa 


« rhOa, stOb 


= 7'hOb. 


st,Oc 




7'hOc, 


stOd 




7'hOd, st.Oe 


« 7*h0e, stOf « 7'hOf, 


St JO 




7'h10. 


stil 




7'h11. stJ2 


- 7'hl2, st13 


= 7'h13. 


st_14 




7'h14, 


SO 5 






« rhl6, s^l7 


= 7'h17, 


st.18 




7'h18. 


st.19 




7'hl9. st.la 


= 7'h1a, stib 


« 7'h1b, 


stjc 




7'h1c. 


s^ld 




7'hld, st.le 


= 7'h1e, st.lf = 


= 7'h1f, 
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st.20 « 7'h20. sUI « 7'h21, st,22 = 7*h22, st.23 = 7'h23, 

st.24 = 7'h24, st.25 = 7'h25, st.26 « rh26. s^27 « 7'h27. 

' st.28 - 7'h28, st,29 - 7'h29, st.2a « rh2a, st2b = 7'h2b. 

st.2c = 7'h2c, stL2d « 7'h2d, st.2e « 7'h2e. sxJ2i « 7'h2f, 

st.30 -» 7'h30, st,31 « 7'h31, $^32 » 7'h32, s^33. o 7'h33, 

st.34 = 7'h34, st.35 ^ 7'h35, st.36 = 7'h36, st37. « 7'h37, 

8t.38 = 7'h38. sU9 » 7'h39, st3a = rh3a, st3b, = 7'h3b, 

sUc » 7'h3c, st3d » 7'h3d, st.3e = 7'h3e, st.3f « 7*h3f, 

st.40 = 7'h40, 8^41 = 7*h41, 8^42 « 7'h42, st43 = 7'h43. 

st.44 = 7*li44. st.45 » 7'h45. st46 = 7'h46. st.47 = 7'h47, 

st.48 « 7'h48. 8^49 = 7'h49, st,4a = 7*h4z, st4b = 7*h4b, 

st.4c = 7*h4c. st4d - 7'h4d, st.4e - 7'h4e, st.4f « 7'h4f. 

st^SO « 7'h50, 8t.51 = 7'h51, st.52 = 7'h52, st53 = 7'h53, 

si.54 = 7'h54, SL55 = 7'h55, st.56 = 7'h56, 5^57 = 7'h57, 

15 st.58 = 7'h58, st.59 = 7'h59, st.Sa = 7'h5a, st5b « 7'h5b, 

s?.5c = 7'h5c. sL5d » 7'h5d. st.5e « 7'h5e. st.5f = 7'h5f, 

st.60 « 7'h60, 8^61 » rh61. 8^62 » 7'h62. st63 « 7'h63, 

st.64 = 7'h64. st.65 = 7'h65, st.66 « 7'h66. st67 « 7'h67, 

81.68 « 7'h68. st_69 = 7'h69. st.Sa - 7'h6a, s^6b = 7'h6b. 

st.6c •= 7*h6c. 8t6d = 7'h6d. st.6e « 7'h6ei sxjSi « 7'h6f. 

8t.70 = 7'h70. 8^71 » 7'h71, 8t.72 « 7'h72, 8^73 « 7'h73, 

8t.74 » 7'h74. 8^75 « 7'h75, 8^76 » 7'h76, 8t.77 = 7'h77, 

st.78 = 7'h78, 8t.79 « 7'h79. st7a = 7'h7a, st7b = 7'h7b, 

st.7c = 7'h7c. st.7d = 7'h7d, 8t.7e = 7*h7e, st.7f = 7'h7f; 



20 
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reg 16:01 Msynopsys enum bus.states ♦/ st8, nxstS; 

pararrieter [3:0] 

LrO = 4'dO, i.rl = 4'dl, Lr2 - 4'd2, i.r3 « 4'd3, 

i_r4 = 4'd4. Lf5 « 4'd5. \jB = 4*d6, Lr7 « 4'd7. 

5 i.sO « 4'h8, i.s1 = 4'h9. i.s2 = 4'ha, l.sS « 4'hb, 

ls4 o 4'hc, i_s5 « 4'hd. Ls6 « 4'h3, i.s? = 4'hf; 

parameter [2:0] 

load - 3'dO, accm « 3'd1, mutt «= 3'd2, invt « 3'd3, 

dbls « 3'd4, sqre = 3'd5, stat = 3'd6. noop « 3'd7, 

^0 O-fO - 3'dO, o.rl « 3'd1, o.r2 = 3'd2, o.rS - 3'd3, 

o.r4 = 3'd4, o.rB - 3'd5, o.r6 = 3'd6, o.r7 = 3'd7; 

regllOrOlfnk; 
wire 1 1:0) decrstat; 

wire [3:0] sourcadd; 
15 wire (2:0) instruct; 

wire [2:0] destnadd; 

wire finished: 

wire bz: 

wire nz; 
20 wire instatus; 
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assign bz » instatus; 
assign nz = -bz; 

assign sourcadd « fnk (10:7]; 
assign instruct « fnk 16:4]; 
5 assign destnadd « fnk 13:1]; 
assign finished » fnk [91; 



always @(st8 or bz or nz) 
case Ist8) // synopsys f ull.case parallel.case 



st.OO:begin nxstS-st.OI; fnk^p.sO.load.o.nj'dO}; end // SO intoRI muttipfer 

st.01 :begin nxstS = st.02; fnk = {i,s2,mult.o.rO, 1 'dO}; end // S0S2 into RO accumulator 

st.02:begrn nxst8=st.03; fnk«={i.s1, sqre,o.r2,rd0}; end // SU2 into R2 

st.03:begin nxst8=:st.04; fnk={4'h2,accfn,o.r2.rdO); end // S0S2 + S1**2 into R2 



/★ R2 now contains detIM2] */ 

l-k detlM2( is nonzero if there are two or more errors */ 

/♦ det(M2] is zero if there are one or more thatn 2 errors */ 



st.04:begin nxst8-({7{bz)}&st_4e)|({7{nz)}to05); fnk=(4'h3.load,o.r3.rd0); end // noop branch 
St 05:begin nxst8-st.06; fnk«{i.s3,multo.r0.Vd0}; end // S0S3 into RO 

accumulator 

St OBrbegin nxst8«=st.07: fnk={i.s1Joad,oj1.rdO}; end // SI into R1 
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multiplier 

St 07:b€oin nxst8«st.08; fnk = {l.s2.niulto.r3,rdO); end // S1S2 into R3 

St 08:beoin nxstS -st.09; fnk « {4'h3.accfn,o.r3. 1 'dO); end // SI S2 + S0S3 into R3 

R3 is the numerator of Lambda 1 
5 /♦ R3 is zero causes the two error solution to fail */ 

st.09:beoin nxst8«({7{bz)}&st.66)|({7{nz})&st.0a); fnk«(4'h3.load.o.r3.rd0); end // noop branch 
SLOa:beQin nxst8«st.0b; fnk«{i.s3,mult,o.rOJ'dO); end//S1S3 into RO 

accumtator 

st.Ob:beoin nxst8 « st.Oc: Ink = {i.s2.sqre,o.r4, 1 'dO); end // S2**2 into R4 

10 st.Oc:beoin nxst8 «st.Od; fnk « {4'h4,accm,o.r4, VdO); end // S2**2 + SI S3 Into R4 

/♦ R4 is the numerator of tambda2 

/* R4 is zero causes the two error solution to fail */ 

st.Od:beoin nxst8-({7{bz}}&st.66M({7{nz)}&st.0e): fnk«{4'h4,load,o,r4.rd0); end //nop branch 
st.Oe:beain nxst8 « st.Of; fnk {4'h2,in vt^o.rl , 1 'dO); end // inverse{det(M2l} into R1 

15 multiplie 

st.Of :beoin nx5t8 = st.1 0: fnk « {4'h3,multo.r3, 1 'dO); end // Lambdal into R3 

st.1 0:begin nxst8 -st.1 1 : fnk « {4'h4.mult.o.r4, 1 'dO}; end // Lambda2 into R4 

/« now we have equation of the form z**2 + Lambda2 e s 0 to solve W 



- 40 - 



wo 97/00559 



PCT/US96/10074 



stj 1 :begin nxst8«:stJ2; fnk«{4'h3,lnvt.o.r1,rd0}; end // 1/Lambdal 
multiplie 

stJ2:begin nxst8 = stJ3; fnk = {i.s1,mult,o.r7,rd0}; end // Sl/Lambdal 
5tJ3:begin nxst8-stJ4; fnk = {4'hl.sqre,o.f1,rdO); end // 1/{LambdaU*2) 
5 multiplie 

stJ4:beQln nxst8=st.1S; fnk-{4'h4.multo.n,rdO}; end // Lambda2/(Lambdal**2) 
multiplie 

/♦ now we have equation of the form v**2 + y + c 
/« where c » Lambda2/(Lambda1««) 
0 if one solution exists, we call it w 

stJ5:begin nxst8=st.16; fnk-{4'hl,dbls,ojl,rd0); end // w into R1 multiplier 
/♦ no such solution causes the two error solution to fail ♦/ 

stJ6:beoin nxst8o({7bz})&st.66)|({7{nz}}&stJ7); fnk«{4'h1Joad.o.r1J'dO); end // noop branch 
st.l7:begin nxst8=stJ8; fnk»{4'h3,mult.o.r4.rd0); end // Lambdal w = XO into R4 
5 stJ8:begtn nxst8-stJ9; fnk«{4'h4,load.o.rO,rdO); end // XO into RO accumulator 

St.19:beoinnxst8«stja;fnk={4'h3,accm,o.r5j'd0); end //Lambdal + XP = XI into R5 

/* now we have the error locators done, find error values */ 



into R1 

into R7 
Into R1 

into R1 



= « 0 to solve */ 
♦/ 

♦/ 
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stjarbegin nxst8«stjb; fnk = {1.50, mult, o.rO.I'dO}; end // SO w 

stJb:beo-rn nxstS-stJc; fnk = {4'h7.accm,o.r0.rd0}; end // SO w + Sl/Lambdal « Yl into RO 
/* a zero error value causes the two error solution to fall */ 

st.lc:beflln nxst8H{7{bz}}&st.66M{7{nz)}&stJd); fnk«4'hOJoad,o.r0.rdO); end // noop branch 
stJd:beoin nxst8«stjc; fnk-{i.s0.accm.o.r6J'dO}; end // SO + Yl - YO into 46 

/♦ zero error value causes the two error solution to fail ♦/ 

st.1e:beoin r«st8 = ({7(bz))&st.66)|({7(nz}}&st.1f); fnk-{4'h'6,load,o.r6J'dO}; end // noop branch 
st.lf:beoin nxst8-st.20; fnk»{4mioad,o.r7.rd0}; end // Yl moved to standard place, R7 

/♦ double error solution is now complete {R4. R5. R6. R7} = {XO, XI , YO, Yl) */ 
/♦ check the solution against all syndromes: ^/ 



st.20:begin nxst8«st,21; fnk«{4'h4.sqre.o.rl,rdO}; end //X0**2 
multipUe 

st.21:beQtn nxst8«st.22; fnk=:{4'h6,mult.o,r2.rd0}; end // X0**2 YO 

st.22:begin nxst8-st.23; fnk={4'h5.sqre.o.rl.1'dO); end // Xl**2 
multiplie 

st.23:bcgin nxst8-st.24; fnk={4'h7,mult,o.r3,rd0); end // Xl**2 Yl 

st.24:begin nxst8«st.25; fnk»{4'h3.load.o.rO.VdO); end // X1**2 Yl 
accumulat 



into R1 

into R2 
into Rl 

into R3 
into RO 
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st.25:beoin nxst8==st.26; fnk = {4'h2,accfn,o,rO,rdO}; end // XU*2 Yl + X0**2 YO into RO 
accumulat 

st.26:b80innxst8 = st.27; fnk = {i.s2.accm.o.r0,rd0); end // S2 + X1**2 Yl + X0**2 YO into RO 
a nonzero value causes the two error solution to fail 

5 st.27:begin nxstB = ({7{nz})&st.66) | ({7{b2))&st,28); fnk=4'hOJoad,o.rO,rdO}; end // noop branch 

5t.28:beoin nxst8«st.29: fnk«{4'h4,load,o.rl,1'd0}: end // XO into Rl 
multiplie 

st.29a>egln nxstS = st.2a; Ink = {4'h2.mult,o.r2.TdO}; end // X0**3 YO into R2 

st,2a:beoin nxst8=st.2b; fnk«{4'h5Joad,o.rl.rdO}; end // XI into Rl 
10 multipRe 

st^2b:beoin nxstS ^ st.2c; f nk = {4'h3,mutt,o.r3.1 'dO}; end // X1 **3 Y1 into R3 

st.2c:bcoin nxst8s=st.2d; fnk=»{4'h3,load.o.r0,rd0); end // XU*3 Yl into RO 
accumuiat 

st.2d:beoin nxst8 = 5t.2e; fnk = {4'h2,addm,oj0,rd0); end // Xl**3 Yl + X0**3 YO into RO 
15 accumulat 

st.2c:bcQinnxst8=*st.2f; fnk = {i.s3,accm,o.r0.rdO); end//S3 + X1**3Y1 + X0**3 YO intoRO 



/* a nonzero value causes the two error solution to fail «/ 



st.2f:begin nxst8«=({7nz}}&st_66|({7{bz))&st_30); fnk=4'h0.load.o.r0.rd0); end //noop branch 
st_30:beoin nxst8=st.31; fnk = {4'h3.fnult.o.r3,rd0}; end // XU*4 Yl into R3 

st.3l:begin nxst8 = st.32; fnk = {4'h4Joad,o.rl,rd0}; end // XO into Rl 
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multiplie 

st.32:bcoin nxst8=st.33; fnk = {4'h2,mult,oj2J'dO}; end // X0**4 YO into R2 

st.33:beBin nxstS =st.34; fnk- {4'h3,load,o.r0, VdO}; end // X1**4 Yl into RO 

accumulat 

5 st.34:beoin nxst8=st.35; fnk=:{4'h2,accm,o.r0,rd0}; end // X1**4 Yl + XO*^^ YO into RO 
accumulat 

st.35:be0in nxst8ost.36: fnk-{Ls4,accm,o.rO,rdO}; end // S4 + Xl**4 Yl + X0**4 YO into RO 
/* a nonzero value causes the two error solution to fail V 

st.36:bcoin nxst8 = ((7{nz}}&st.66|{{7{b2})&st.37); fnk»{4mioad,ojO,VdO); end //noop 
10 st.37:beoin nxst8«st.38; fnk-{4'h2,mult.o.r2,l'd0); end // X0**5 YO into R2 

st.38:begin nxst8-st.39; fnk={4'h5Joad.o.rl,Vd0); end // XI into R1 

multiple 

st.39:beQin nxst8 «st.3a: fnk - {4'h3,mult,o.r3J'd0}; end // XU*5 Yl into R3 

st.3a:beoin nxst8=st.3b; fnk = {4'h3,load,o.r0,rd0}; end // XU*5 Yl into RO 

15 accumulat 

st.3b:beoin nxst8 = st,3c; fnk"{4'h2,accm,o.r0,rd0}; end // Xl**5 Yl + XO^S YO into RO 
accumulat 

st,3c:beoin nxst8«st.3d; fnk»{i.s5,accm,o.rO.VdO}; end // S5 + XU*5 Y1 + X0**5 YO into RO 
M a nonzero value causes th two error solution to fail «/ 
20 st.3d:begin nxst8 = ({7nz})&st.66|({7{bz})&st.3e); fnk = {4'hO,load,o.rOJ'dO}; end // noop branch 
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sC3e:beoin nxst8-st.3f; fnk»{4'h3,mult.o.r3,VdO); end // XU*6 Yl into R3 

st.3f:beoin nxst8 = st,40; fnk = {4'h4,load,o.rl,rdO); end // XO into Rl 

multiplie 

st.40:beoin nxstS -st.41 ; fnk = {4'h2,mult,o.r2.rd0}; end // X0**6 YO into R2 

st.41 :begin nxst8=st.42; fnk«{4'h3Joad,ojai'dO}; end // Xl**6 Yl into RO 

accumulat 

st.42:be9in nxst8 = st_43; fnk-{4'h2,accnn,o.r0.rd0); end // Xl**6 Yl + X0**6 YO into RO 
accumulat 

st.43:beoin nxst8-st.44; fnk-{i.s6,accm.o.r0.rd0}; end // S6 + XU*6 Yl + X0**6 YO into Ro 
/* a nonzero value causes the tow error solution to fall ♦/ 

st,44:beoln nxst8 = ({7{n2}}&st.66) | ({7{b2)}&st.45); fnk « {4'hO Joad.o.rO, 1 'dO}; end // noop branch 
s^45:bcoln nxst8 -st.46; fnk«{4'h2,mult,o.r2, VdO}; end // X0**7 YO into R2 

st.46:beoin nxst8 =st^47; fnk « {4'h5 Joad,ojl .VdO); end // XI into Rl 

multiplie 

st.47:beoin nxstS «st.48; fnk«{4'h3,mult,o.r3.rd0); end // XU*7 Yl into R3 

st.48:begin nxst8«st.49; fnk«{4'h3,load,o.rard0); end // Xl**7 Yl into RO 

accumulat 

st.49:begln nxst8-st,4a; fnk-{4'h2.accm,o.r0.rd0}; end // XI -♦7 Yl + X0**7 YO Into RO 
accumulat 

sL4a:beoin nxst8=st.4b; fnk«{Ls7,accm.o.rO,rdO); end // S7 + XU*7 Yl + X0**7 YO into RO 

/* a nonzero value causes the two error solution to fail */ 
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st.4b:begln nxstS = ({7{n2}}&st.66({7{bz}}&st 4c); fnk «= {4'h0,load.o.r0,rd0}; end // npop branch 
st.4c:beoln nxst8 = st.4d; fnk-{4'h2,stat,o.f2,rd0); end // set status register to 2 errors 
st.4d:beoin nxstS - st.69; fnk - {4'h2,load.o,r2, T 'dO}; end // noop unconditional branch 

/* start single error decoding 

st.4e:beoin nxstS « st.4f ; fnk « {i.s0.invt.o^rl . 1 'dO); end // 1 /SO into R1 

muttipiie 

/« a zero value causes the single error solution to fail */ 

st.4fcbegin nxst8«({7{b2))&st.66)|({7{nz})&st.50); fnk«{4'h1Joad,o.rr VdO}; end //noop branch 
st.50:begin nxst8-=st.51; fnk = {i.s1,mult,o,rl,rdO}; end // SI/SO » XO into R1 

/* a zero value causes the single error solution to fail */ 

st.51:begin nxstS = ({7{b2}}&st.66)| ({7{nz})&st.52); fnk-{4'h1 Joad.o/I.I'dO); end // noop branch 
st.52:beoin nxst8=st.53; fnk={i.sl.mult.o,rO,rdO); end // SI XO into RO 

accumulat 

st.53:begin nxstS = st.54; fnk - {i.s2,accm.o.rO. 1 'dO}; end // SI XO + S2 into RO 

accumulat 

/♦ a nonzero value causes the single error solution to fail ♦/ 
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st,B4:beoin nxst8«({7(nz})&st.66H({7{nz}}&st.55); fnk = {4'h1Joad,o/1.1'dO}; end //noop branch 
st.55:beoin nxst8«st.56: fnk = {i.s2.mult,o,rO. TdO); end // S2 XO into RO 

accurhutat 

st.56:begin nxstS^st^S?; fnk«»{j.s3.accm,o.rOJ'dO}; end // S2 XO -f S3 into RO 

5 accumulat 

/* a nonzero value causes the single error solution to fail 

st^57:begin nxst8»({7{nz}}&st.66)|({7{bz})&st.58); fnk»{4'h1Joad,o.r1J'dO): end // noop branch 
st^SSibegin nxst8"St.59;.fnke{i.s3,mu1t,o.rOJ'dO}; end // S3 XO into RO 

accumulat 

10 st.59:begin nxst8e:st.5a: fnk<={i.s4.accm,o^rOJ'dO}: end // S3 XO -i- S4 into RO 

accumidat 

a nonzero value causes the single error solution to fail */ 

st.5a:begin nxst8«({7{nz)}&st.66|({7bz}}&st.5b); fnk = {4'h1,load,o.rl,rdO); end // noop branch 
st.5b:b8gin nxst8«st.5c; fnk»{i.s4,mufto.r0.rd0}; end // S4 XO into RO 

15 accumulat 

st.5c:begin nxst8=st.5d; fnk«{i.s5,accm,o.rO.VdO}; end // S4 XO +S5 into RO 

accumulat 

/« a nonzero value causes the single error solution to fail *f 
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st.Bdrbeoin nxst8«{{7{ni}}&st.66|({7{bi))&st.5e); fnko{4'h1,load,oj1.1'd0); end // noop branch 
st.5e:beoin nxst8«st.5f; fnk={Ls5.mult,o.r0.rdO}; end // S5 XO into RO 

accuniulat 

st.5f:beoin nxst8«st.60; fnk«{i.s6.accm,o.fOJ'dO): end // S5 XO + S6 into RO 

5 accumulat 



a nonzero value causes the single error solutbn to fail */ 



st.60:beoin nxst8«({7{n2}}&st.66|((7bz}}&st,61); fnk»4'h1,load,o.rl.VdO}; end // noop branch 

s^SI :begin nxstS = st.62; f nk = {Ls6,nujlt,o.r0.1 'dO); end // S6 XO into RO 

accumulat 

st.62:beoin nxst8=st.63; fnk=s7,accnfi,o.r0,rd0}; end // S6 XO + S7 into RO 

accumulat 



/i* a nonzero value causes the single error solution to fail «/ 

s^63:begm nxst8«({7{nz}}&st.66M{7{bz})&st.64; fnk-{4'h1,load.oj1,VdO); end // noop branch 

st.64:beoin nxst8«st.65; fnk«{4'h1,stato.r1.VdO): end //set status register to 1 error 

st.65:begin nxst8=st.67; fnk=={4*hl,load,o.r1.rd0); end // noop unconditional branch 

st.66:beoin rixst8-st.69; fnk«(4'h0.stat,o.r0.rd0); end // decoder failure: set status to 0 

st.67:beoin nxst8«st.68; fnk«{l.r1Joad,o.r4,rdO}; end // load r1 value into r4 iocater output 

st.68:begin nxst8=st.69; fnk = {i.sOJoad,o.r6,VdO}; end // load SO value into r4 iocater output 

st.69:beoin nxst8=st,69; fnk = {4'hO,load,o.r0.rdl}; end // decoder complete 
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endcase 

always @ (posedge constclk or negedge resetn) 
begin 

if (iresetn) 

st8 < « s^69; 
else if (finished && decstan) 
st8 < « 7'dO; 

else 

st8 < e nxstB: 

end 

gf.alu niylstalu( 

•resetn (resetn), 

.syrsOOOO(syrsOOOO), 

•syrsOOOl (syrsOOOl ), 

^yri0002(syrs0002), 

.syrs0003<syrs0003), 

.syrs0004(syrs0004), 

. syrsOOOS (syrsQOOS ) , 

.syrs0006(syrs0006), 

.syrs0007(syrs0007). 

.locaterO(locaterO), 

.locater! (locaterl 
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.errvaluO(errvaluO), 

.errvaluKenvalul), 

.decodclk(constclk), 

.tnstruct(instruct), 

.sourcadd(5ourcadd)« 

.destnadd(de5tnadd), 

.decrstat(decrstat), 

.instatusfinstatus); 

endmodule 



15 



20 



{syrsOOOO, 


/* syndrome 0 




*/ 


syrsOOOl , 


/♦ syndrome 1 




♦/ 


SYrsO002. 


syndrome 2 




*/ 


syrs0003. 


h syndrome 3 




*/ 


SYrs0004, 


/« syndrome 4 




*/ 


syrsOOOS, 


syndrome 5 




*/ 


syrsOOOe, 


h syndrome 6 




*/ 


syrs0007« 


/♦ syndrome 7 




*/ 


tocaterO, 


/* register 4 of ALU 


*/ 




locaterl , 


register 5 of ALU 


★/ 




errvaluO, 


/* register 6 of ALU 


♦/ 




errvalul, 


A register 7 of ALU 


*/ 




decoddk« 


alu positive edge clock 20 Mhz 


*/ 




instruct. 


/« insuuction to be executed 


*/ 
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10 



15 



input 
input 
input 
input 
input 
input 
input 
input 
input 
input 
input 
input 
input 



[9:0] 
(9:0) 
f9:0] 
(9:0) 
[9:01 
t9:0] 
I9:0J 
(9:01 

(2:0) 
(3:0) 
(2:0) 



sourcadd, 
destnadd, 

instatus 

); 

resetn; 

syrsOOOO; 

syrsOOOl ; 

syrs0002; 

syr$0003; 

syrsOOOA; 

syrs0005; 

syrs0006; 

syrsOOO?; 

decodcik; 

instruct; 

sourcadd; 

destnadd; 



/« address of source operand 
address of destination operand 
decrstat, In status of decoder 

/« status of completed instruction 



*/ 



output (9:01 locaterO: 

20 output (9:01 locaterl; 

output (9:0] errvaluO; 

output (9:01 errvalul; 

output (1:0J decrstat: 
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5 



10 



15 



output 




instatus; 


//reo 




tnstatus; 


reg 


19:0] 


workreoO; 


reo 


11:0) 


decrstat; 


reo 


19:0) 


workreol: 


reo 


19:0) 


workreo2; 


reo 


[9:0) 


workreo3; 


reo 


(9:0) 


workreo4; 


reo 


f9:0) 


workreoS; 


reo 


19:0) 


worlcreo6; 


reo 


19:0) 


workreo7; 


reo 




instatus; 


reo 


[9:0) 


sourcer; 


reo 


[9:0] 


ahiout; 


reo 




status; 



// instruction 0 destination < « source 



20 



wire 


[9:01 


accumuf; 


// instruction 1 destination < = 


sourcer accumulated w output 0 


wire 


19:0] 


product; 


// instruction 2 destination < « 


sourcer multiplied by output 1 


wire 


(9:0) 


inverse; 


// instruction 3 destination < » 


inverse of sourcer 


wire 


(9:0) 


dblsoln; 


// instruction 4 destination < « 


soln of {vm2 -i- y + sourcer - m 


wire 


19:0) 


squared; 


// instruction 5 destination < s 


sourcer«*«2 



0) 



// instruction 6 no operation 



- 52 - 



wo 97/00559 



PCT/US9d/10074 



// instruction 7 no operation 

wire iszero: 

wire noroot; 

wire statenab; 

wire 17:01 reoenab; 

assign locaterO « worl(reg4; 
assign locaterl « workregS; 
assign errvaluO = workregG; 
assign errvalul « workreg?; 

/★ get source operand «/ 

always @(sourcadd or 

workregO or workreg 1 or workreg2 or workregS or 
workreg4 or workregS or workregB or workreg7 or 
syrsOOOO or syrsOOOl or syrs0002 or syrs0003 or 
syrs0004 or syrsOOOS or syrsOOOe or syr50007) 

begin 

case (sourcadd) 
4'dO : sourcer = workregO; 
4*d1 : sourcer « workreg 1; 
4'd2 : sourcer « workreg2; 
4'd3 : sourcer « workregS; 
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4'd4 : sourcer « workregA; 
4*d5 : sourcer = workregS; 
4'd6 : sourcer = workrcgS; 
4'd7 : sourcer = workreg7; 
4'd8 : sourcer « syrsOOOO: 
4'd9 ; sourcer = syrsOOOl; 
4'd10 : sourcer « syrs0002; 
4'd11 : sourcer = syrsOOOS; 
4'd12 : sourcer » syrs0004; 
4'dl3 : sourcer ^^.syrsOOOS; 
4'd14 : sourcer « syrsOOOG; 
4'd15 : sourcer « syrs0007; 
endcase 

end 



/* computation units */ 



assign accumul « workregO " sourcer; 

multiply decmultrCxIworkregl ), .y(sourcer), .z(product»; 

decinvnl.yisourcer). .zCinverse)); 
dbltrbl dectablet.cteourcerh .y(dblsolnI, .noroot)); 

squarer decsquar(.x(sourcer), .2(squarcd)); 



/* assign alu output and status */ 
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always @ (instruct or 
sourcer or 
accumui or 
product or 
inverse or 
dbtsoln or 
squared or 
noroot or 
iszero) 

begin 

case (instruct) 
3'dO : begin aluout « sourcer; status « iszero; end 
3*d1 : begin atuout = accumut; status = iszero; end 
3*d2 : begin aluout = product; status = iszero; end 
3'd3 : begin aluout « inverse; status « iszero; end 
3'd4 : begin aluout = dblsoln; status « noroot; end 
3'd5 : begin aluout = squared; status « iszero; end 
3'd6 : begin aluout « sourcer; status « VdO; end 
3'd7 : begin aluout « 9'dO; status = VdO; end 

endcase 

end 

/•* assign iszero as zero output of alu */ 
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assign iszero » -(| aluout); 

M assign output register enables W 



assign statenab = (Instruct = - 3'd6); 



assign regenab[0] 


s 


(destnadd 


B B 


3'dO); 


assign regenab(l) 




(destnadd 




3'd1); 


assign regenab(2J 




(destnadd 


sa B 


3'd2); 


assign regenabP] 




(destnadd 


B B 


3'd3); 


assign regenab(41 




(destnadd 


B B 


3'd4); 


assign regenabIS] 


B 


(destnadd 


a ms 


3'd5); 


assign regenab[6I 




(destnadd 


B S 


3'd6); 


assign regenab[7] 




(destnadd 


B B 


3'd7); 



/* update outputisters W 

always @ (posedge decodclk) 
begin 

//Note: add reset for workrcgO only, since a known value is 
//needed here during initialization, 
if (resetn » « 0) 

workregO < = 10'hOOO; 

else 
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workreoO < - ({lOfreoenaWOJ}) & aluout | (flOf^regenaWOl}) & workregO) 
workregl <« «10{reoenabn)}) & aluout | ((10{^r8oenab[1]}} & workregl): 
workreo2 < = ({10{reoenab[2J}) & aluout | (n0{-.regenab(2J)) & workreg2); 
workreg3 < « ({10{regenab[3]}} & aluout | (I10{-regenabl3J)) & workregS); 
workregA < « ({10{regenab[4)}} & aluout | «10{-.regenabl4)}} & workreg4); 
workreo5 < « l{10{rcgenabl5J}} & aluout | ((10{-reoenab[53)} & workregBJ; 
workregO < - ({10{regenabI6)}} & aluout | «10{-regen3b[61)} & workreg6); 
workreg? <- ({lOfregenabf?})) & aluout | CllOf-regenabt?]}) & workreg?); 
mstatus < B status; 

if (statenab) 

decrstat <» sourcadd(1:0]; 

endmodule 

module multipty y, 2); // gaiois field multiplier of x and y to get z 

15 input (9:01 x; 
Input [9:01 y; 
input 19:0J 2; 

wire 14:0) IsbO; 
wire 14:01 msbO; 



10 



end 
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wfref4:0J Isbl; 
wireC4:0J msbl; 
wire 14:01 msbOmsbl; 
wireI4:0J IsbOlsbl; 
wire C4;01msb01sbl; 
wif« (4:01 msbllsbO; 
wire (4:0) pOmOmI; 
wire (4:0] plmOml; 

assign IsbO ■> x(4:0]; 
assion Isbl « y[4:0]; 
assign msbO - x(9:5J; 
assign msbl ^ v(9:5]; 

gatemulS submO( .xdsbO), .ydsbl), .zdsbOlsbl)); 
gatemulS submK .x(msbO). .yimsbl), ^(msbOmsbl)); 
gatemufS subm2( .xflsbO), -Wmsbl), ^(isbl UbO)); 
gatemulS subm4( .x(msbO). .y(1sb1|, ^(msbOlsbl)); 
assign pOmOmI « msbOmsbl; 
assign pi mOmI = msbOmsbl; 

assign 2(4:0] - IsbOlsbl ^pOmOml; 

assign 2(9;5) « msbOlsbl * msbl IsbO ^ plmOml; 
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endmoduie 

modufe oatefnul5(x, y, 2); // gaiots field multiplier of x and y to get 2 

input 14:0) x; 
input (4:01 v; 
input 14:0) 2x; 

wire (4:0J alphaOx; 
wire (4:01 alpha 1x; 
wire (4:01 alpha2x; 
wire (4:01 alpha3x; 
wire (4:0) alpha4x: 

assign alphaOx « x; 

assign alphalx - {alpha0xl3:0],rb0} * ({5{alphaOx[4)}} & 5'd5); 
assign alpha2x « (alphalx[3:0JJ'b0) * <{5{alpha1x(4))} & 5'd5); 
assign alphaSx = {alpha2xI3:OJ,rbO} ^ ({5{alpha2x(4))) & 5'd5); 
assign alpha4x = {a!pha3x(3:0!,1 'bO) * ({5{alpha3x(4)}} & 5'd5}; 

assign zIO] ^ 

(alphaOx(O) & yfOIl * 
(alphalx(O) &y[1j)- 
(alpha2xI0) & yI21) * 
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Ulpha3xtO] & yPI) " 
Ialpha4xl0] & yI4]): 

assign z[1) 

(alphaOxd) & vfOll" 
(alphalxlU & yfl)) ^ 
(alpha2x(1) &y(2]r 
(alpha3x[1] &y[3)]^ 
(alphaAxIII &y(4]): 

assign z(2| » 



10 



(alpha0x(2] & y[0]) " 
(aipha1xC2) &y[1])^ 
(alpha2x[2] & y(2)) ^ 
(alpha3xl2| & yl31) " 
(atpha4xt2) & y(4]); 



15 assign z[3] » 

UlphaOxrS) & ytOl) * 
<alpha1x[31 & ylll) " 
(a!pha2x[3) & yf2J) " 
(alpha3x(3] & y(3]) " 

20 (alpha4xI31 & yt41): 
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assign z[4) *> 

(alpha0x(4] & v(On " 
(alpha1xI4) &y[in* 
(alpha2x[4) & vt2]l " 
(alpha3xI4) & y[31l " 
(alpha4xI4] & y(4)); 



endmodute 



module squarerix, z); 
input (9:0] x; 
output 19:0] z; 

assign 2 « <{10{xC0J}} & lO'dl) 
({10{x(1]}} & 10'd4) 
({10(x(2)}} & lO'dl 6) 
<{10{x[3J)) & 10'dlOJ 
({10{xl4))} & lO'dl 3) 
C{10{x(5)}} & 10'd33) 
({10{x[6l}} & 10'dl32) 
({10{xf7J}) & 10'ce28l 
({10{xl8))} & 10'd330) 
({10{x(9]}} & 10'd429} 

endmodute 
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module inverse (y. 2); // oalois field inverter of v to get 2 

input • [9:0) y; 
input [9:0] z; 

wire [4:01 Isb; 
5 wire [4:0) n«b; 

wire (4:0) Ublsb; 
wire [4:0) msbmsb; 
wire [4:0] Isbmsb; 
wire (4:0) plmsb; 
10 wire (4:0) denom; 
wire (4:0) numer; 
wire (4:0) pOmsbmsb; 
wire (4:0) pi Isbmsb; 
wire (4:0) litlinvt; 

15 assign Isb = v(4:0]; 

assign msb » y[9:5]: 

gatemulS submO( .xdsb). .ydsb). .zdsbOlsbl); 
gatcmulS submll .x(msbl, .y(msb), .z[msbmsW); 
assign plmsb » msb; 
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oatemutS subm4( .x(plmsb), ,y(1sbl. .2(pl Isbmsb)); 
assign pOmsbmsb = msbmsb; 

assign number = isb "pimsb; 

assign denom « Isblsb * pOmsbmsb *p11sbmsb; 

5 of32inv invO(.x(denom), .z<lltlinvt)); 

gatemulS subm5( .xMltllnvt). .yinumer), .2(14:0))); 
gat6mu!5 subm6( .x(1it1lnvt)« .yimsbh .z(z[9:5})); 

endmoduie 

module gf32inv(x« 2); 
10 input 14:0) x; 

output 14:01 2; 

reg (4:0) 2; 
always @(x) 



begin 



15 



case (x) 



S.bOOOOO : 2 



5'bOOOOO; 



5,b00001 : 2 



S'bOOOOl 



S.bOOOlO : 2 



5'blOOIO; 



S.bOOOl 1 : 2 



S'blllOO: 
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5,b00100 : z 
5,b00101 :z 
5,b00110:z 
5,b001 1 1 : z 
5,bO10O0 : z 
5,b01001 : 2 
S.bOTOlO : 2 
5,b010n : 2 
5,b01100 :z 
S.bOIIOI :z 
5,b01110 : z 
5«b01 11 1 : z 
5,b10000 : z 
S.blOOOl : z 
5«b10010 : z 
5,b1001 1 : z 
5«b10100 : z 
5,bl0101 : z 
5,b10110 : 2 
S.blOni : 2 
5,bn000 : 2 
5,b11001 :z 
S.b11010 : z 
5,b110n : z 



- 5'bOIOOI; 
o 5'blOIII; 
« 5'bOinO; 

5'bOnOO; 
« 5'blOIIO; 
«= 5'bOOIOO; 
= 5'bllOOI; 
= 5'blOOOO; 
-= 5'bOOIII; 
« 5'bOllll; 
«= 5'bOOIIO; 
= 5'bOIIOI; 
• 5'bOIOn; 
« 5'b11000; 

- 5'bOOOIO; 
« 5'blllOI; 
» 5'bllllO; 
= 5'bnOlO; 
= 5'b01000; 
= 5'bOOIOl; 
= 5'blOOOl; 
« 5'bOlOIO; 
•= 5'blOIOI; 
= 5'b11111; 
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5,b11100 : 2 « 5'bOOOII; 
5,bni01 : z « 5'blOOll; 
5,b11110 : z = 5'blOIOO; 
5,b1im : z ^ 5'bnOII; 
endcase 
end 
endmodule 

module dbltrbt (c, y, noroot); 

input [9:0] c; 

output (9:0 y: 

output noroot; 

assign noroot « cl8J " c(5]; 



assign y(90] 




VdO 








assign yll] 




cI7J * 


cC4J * 


c[21; 




assign y[2] 


K 


cI7J " 


c[6I ^ 


c(4I * 


cr3); 


assign y[3I 




cfSJ * 


c{6] - 


Cl4) * 


c(2J * dll; 


assign yf7] 


« 


c[9] ^ 


c(5]; 






assign y[9] 




c[5J; 








assign yf6) 




c[9I - 


cl7J; 






assign y(8] 




y(6j - 


c[6J; 






assign y|4] 




y(6j * 


crei * 


cf5) * 


cf3J; 


assign y(5) 




V[8I * 


cl3I * 


cfO); 




endmodule 
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10 



15 



20 



Thus, an error correction methodology and system architecture 
for implementing a lo-bit Reed-Solomon code has been disclosed. 
While the foregoing detailed description was directed to a 
semiconductor based data storage and retrieval system, it will be 
apparent to those skilled in the art that the described error 
correction methodology and architecture can be effectively 
practiced with any digital data storage system, e.g., such as 
magnetic or optical based memory systems, as well as with any 
system involving the handling, transfer, and/or storage of blocks 
of digital data between elements thereof, including digital 
communications systems, where the correction of data being 
transmitted or stored is required. While the present invention is 
particularly well suited for systems and architectures encountering 
non-bursty, substantially random data errors, it may be equally 
employed in those systems and architectures encountering bursty 
error patterns, as well. 

Thus, it would be apparent to those skilled in the art that 
many more modifications are possible without departing from the 
inventive concepts herein. The invention, therefore is not to be 
restricted except in the spirit of the appended claims. 
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What is claimed is: 
1 1. A data storage and retrieval system for receiving, 

storing and retrieving, respectively, digital data paketized in 
3 blocks of k in-bit data symbols, comprising: 

means for reformatting the blocks of m-bit data symbols into 
5 blocks of 10-bit data symbols; 

an encoder having means for generating r lo-bit error check 
symbols from each reformatted block of 10-bit data symbols; 

means for reformatting said r 10-bit error check symbols 
generated by said encoder into {10/m)*r m-bit error check symbols; 

means for appending said (10/m)*r m-bit error check symbols to 
the respective m-bit data block from which they were generated; and 
data storage means for receiving and storing, respectively, 
said m-bit data blocks, including the respective appended (10/m)*r 
14 m-bit error check symbols - 



2 



4 
5 
6 
7 
8 
9 
10 
11 
12 
13 



1 2. The data storage and retrieval system of claim 1, further 

2 comprising means for retrieving said stored m-bit data blocks from 
said data storage means and reformatting them into respective 10- 
bit code words, each retrieved 10-bit code word comprising k 10-bit 
data symbols and r 10-bit error check symbols. 
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3. The data storage and retrieval system of claim 2. further 
comprising 

• a syndrome generator for generating r lo-bit error syndromes 
from each retrieved 10-bit code word; 

means for detecting the existence of at least one non-zero 
error syndrome in a given set of generated error syndromes; and 

error correction circuitry comprising: 

means for receiving said r lo-bit error syndromes from said 
syndrome generator when at least one non-zero syndrome is detected, 

a dedicated Galois field arithmetic logic unit 
having means for executing arithmetic operations on 10- 
bit symbols by performing extension field operations over 
a Galois field of 2*, 

a programmed controller for providing state 
instructions to said arithmetic logic unit which carry 
out a programmed error correction routine for determining 
the location of, and correction value for, respectively, 
up to 2 erroneous 10-bit data symbols in a given 
retrieved 10-bit code word. 



4. The data storage and retrieval system of claim 3, said 
dedicated Galois field arithmetic logic unit utilizing a Galois 
field GF(2^) generated from the irreducible polynomial g(x) « + 
x2 + 1, over GF(2) • 
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5. The data storage and retrieval system of claim 4, said 
dedicated Galois field arithmetic logic unit further generating the 
Galois field GF(2»°) from the irreducible polynomial P,o24(y) = y^ + 
y + 1, over GF(2^) • 

6. The data storage and retrieval system of claim 5, said 
Galois field GF(2'°) having as its primitive element (lOl)h. 

7. The data storage and retrieval system of claim 3, said 
data storage means characterized in that data errors occurring in 
a block of data stored therein occur in substantially random bit 
locations. 

8. The data storage and retrieval system of claim 3, said 
data storage means comprising a semiconductor based memory. 

9- The data storage and retrieval system of claim 8, said 
semiconductor based memory comprising a FLASH random access memory. 

10. The data storage and retrieval system of claim 3, said 
encoder combined with said syndrome generator in a single circuit. 
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1 
2 



4 

5 
6 



11. A data storage and retrieval system for receiving, 
storing and retrieving, respectively, data packetized in blocks of 
3 a-blt data symbols, comprising: 

an interface circuit for receiving the blocks of m-bit data 
symbols from an external data processing system; 

a first buffer for temporarily storing the blocks of m-bit 

7 data symbols from said interface circuit; 

8 means for reformatting said blocks of m-bit data symbols 
received from said interface circuit into blocks of k lo-bit data 

10 symbols ; 

11 an encoder circuit having means for generating r lo-bit error 
check symbols from each block of k lo-bit data symbols; 

means for reformatting said r 10-bit error check symbols 
generated by said encoder into (10/m)*r m-bit error check symbols, 
wherein said (10/m)*r m-bit error check symbols are appended in 
said first buffer to the respective m-bit data block from which 



9 



12 
13 
14 
15 
16 



17 they were generated; 
18 
19 



20 
21 
22 
23 
24 



data storage means for receiving and storing, respectively, 
said m-bit data blocks, including the respective appended m-bit 
error check symbols from said first buffer; 

means for retrieving said stored m-bit data blocks frbm said 
data storage means and reformatting them into respective 10-bit 
code words, each retrieved lO-bit code word comprising k 10-bit 
data symbols and r 10-bit error check symbols; 
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25 a syndrome generator for generating r lo-bit error syndromes 

26 from each retrieved 10-bit code word; 

27 • means for detecting the existence of at least one non-zero 

28 error syndrome; and 

29 error correction circuitry comprising: 

means for receiving said r 10-bit error syndromes 
said syndrome generator when at least one non-zero 
32 syndrome is detected, 

a dedicated Galois field arithmetic logic unit 
34 having means for executing arithmetic operations on lo- 

ss bit symbols by performing extension field operations over 

36 a Galois field of 2^ 

^"^ a programmed controller for providing state 

38 instructions to said arithmetic logic unit which carry 

39 out a programmed error correction routine for determining 
the location of, and correction value for, respectively, 
up to 2 erroneous 10-bit data symbols in a given 

42 retrieved 10-bit code word. 

1 12. The data storage and retrieval system of claim 11, said 

2 dedicated Galois field arithmetic logic unit utilizing a Galois 

3 field GF(2*) generated from the irreducible polynomial g(x) = x^ + 

4 x^ + 1, over GF(2) . 
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1 13. The data storage and retrieval system of claim 12, said 

2 dedicated Galois field arithmetic logic unit further generating the 

3 Galdis field GF(2*^) from the irreducible polynomial PjowCy) = y^ + 

4 y + 1, over GF(2^), and having as its primitive element (lOl)h 

1 14. The data storage and retrieval system of claim li, 

2 further comprising a second buffer for receiving and temporarily 

3 storing, respectively, said retrieved m-bit data blocks from said 

4 data storage means. 



1 15. The data storage and retrieval system of claim 11, 

2 wherein r = 8, 

1 16. The data storage and retrieval system of claim 11, 

2 wherein (10/m)*k « 256, m = 16, and r = 8, respectively. 

1 17, The data storage and retrieval system of claim ll, 

2 wherein (10/m)*k « 512, m = 8, and r = 8, respectively. 
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6 



18. In a data processing system, error correction apparatus 
for detecting and correcting data errors, comprising: 

• means for receiving r lo-bit error syndromes generated from a 

4 10-bit Reed Solomon code word; 

5 a dedicated Galois field arithmetic logic unit having means 
for executing an error correction algorithm by performing 

7 arithmetic operations on 10-bit symbols; 

8 a programmed controller for providing state instructions to 

9 said arithmetic logic unit which carry out said error correction 
10 algorithm. 

1 19. The error correction apparatus set forth in claim 18, 

2 said dedicated Galois field arithmetic logic unit generating a 

3 Galois field GP(2'<») using the irreducible polynomial P,„,(y) - yJ + 

4 y + 1, over a Galois field 2», said Galois field 2» being generated 

5 using the irreducible polynomial g(x) - x' + + i, over gf(2). 



1 
2 



20. The data processing system in claim 18, further 
comprising a semiconductor based digital data storage and retrieval 
sub-system. 
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1 21. In a data processing system, error correction apparatus 

for detecting and correcting data errors, comprising: 

an arithmetic logic unit having means for executing an error 
correction algorithm on 10-bit error syndromes, said arithmetic 
logic unit utilizing a Galois field Gr(2') generated from the 
irreducible polynomial g(x) = + + i, over GF{2); and 

a programmed controller for providing state instructions to 
said arithmetic logic unit which carry out said error correction 
9 algorithm. 



1 

2 
3 



22. The data storage and retrieval system of claim 21, said 
arithmetic logic unit further utilizing a a Galois field Gr(2'®) 
generated from the irreducible polynomial Pio24(y) = y' + y + 1, over 



4 GF(2^). 
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